Sonority Measure for Automatic Speech Recognition

نویسنده

  • Daniil A. Kocharov
چکیده

In this paper, the use of sonority measure as an acoustic feature of the speech signal for continuous automatic speech recognition is described. The representation of sonority extent of sounds is made with a help of spectrum derivation. Therefore, a novel articulatory motivated acoustic feature expressing the sonority is named spectrum derivative feature. The new feature is tested in combination with the state-ofthe-art Mel Frequency Cepstral Coefficients (MFCC) feature. The effects of various warping and filtering techniques on the spectrum derivative feature are investigated. Experiments have been performed on the large vocabulary task (VerbMobil II corpus). Improvement in word error rate has been obtained by combining the MFCC feature with the spectrum derivative feature: of up to 4.5% on the large-vocabulary task (VerbMobil II corpus) relative to using MFCC alone with the same overall number of parameters in the system.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract   Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

Designing and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods

For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges b...

متن کامل

Sonority Contours in Speech Recognition

for their invaluable input into this paper. All errors are my own. The sonority scale that ranks phonemes according to relative " loudness " has long played a significant role in the fields of Phonology and Historical Linguistics, yet it is conspicuously absent from the speech recognition literature. In this preliminary study using the Hoosier Mental Lexicon, it was found that approximately hal...

متن کامل

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...

متن کامل

Spike-v: an Adaptive Mechanism for Speech-rate Independent Timing

A neuronal model intended to target highly sonorant periods of a speech stream is presented. The model—“Spike-V”—uses habituation and Hebbian learning in opposition to each other to dynamically adjust its behavior. Acting in realtime, driven by only the signal, Spike-V produces a spike-train in which each spike corresponds to roughly the center of a period of high sonority (i.e. a vowel) in the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006